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ENTERED 



3 <110> APPLICANT: Wang, Yan 

4 Xi, Lei 

5 Prosen, Dennis E. 

6 MJ Bioworks, Inc. 

8 <120> TITLE OF INVENTION: Improved Nucleic Acid Modifying Enzymes 

10 <130> FILE REFERENCE: 020130-OOOlllUS 

12 <140> CURRENT APPLICATION NUMBER: US 09/870, 353A 

13 <141> CURRENT FILING DATE: 2001-05-30 

15 <150> PRIOR APPLICATION NUMBER: US 60/207,567 

16 <151> PRIOR FILING DATE: 2000-05-26 

18 <150> PRIOR APPLICATION NUMBER: US 09/640,958 

19 <151> PRIOR FILING DATE: 2000-08-16 
21 <160> NUMBER OF SEQ ID NOS : 34 
23 <170> SOFTWARE: Patentin Ver. 2.1 

25 <210> SEQ ID NO: 1 

26 <211> LENGTH: 189 

27 <212> TYPE: DNA 

28 <213> ORGANISM: Artificial Sequence 

30 <220> FEATURE: 

31 <223> OTHER INFORMATION: Description of Artificial Sequence : synthetic 

32 sequence non-specific double-stranded nucleic acid 

33 binding protein SsodVd 

35 <220> FEATURE: 

36 <221> NAME/KEY: CDS 

37 <222> LOCATION: (1)..(189) 

38 <223> OTHER INFORMATION: Ssod7d 

40 <400> SEQUENCE: 1 

41 gcaaccgtaa agttcaagta caaaggcgaa gaaaaagagg tagacatctc caagatcaag 60 

42 aaagtatggc gtgtgggcaa gatgatctcc ttcacctacg acgagggcgg tggcaagacc 120 

43 ggccgtggtg cggtaagcga aaaggacgcg ccgaaggagc tgctgcagat gctggagaag 180 

44 cagaaaaag 189 

47 <210> SEQ ID NO: 2 

48 <211> LENGTH: 63 
4 9 <212> TYPE: PRT 

50 <213> ORGANISM: Artificial Sequence 

52 <220> FEATURE: 

53 <223> OTHER INFORMATION: Description of Artificial Sequence : synthetic 

54 sequence non-specific double-stranded nucleic acid 

55 binding protein Ssod7d 

57 <400> SEQUENCE: 2 

58 Ala Thr Val Lys Phe Lys Tyr Lys Gly Glu Glu Lys Glu Val Asp lie 

59 1 5 10 15 

60 Ser Lys lie Lys Lys Val Trp Arg Val Gly Lys Met lie Ser Phe Thr 
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30 



62 Tyr Asp Glu Gly Gly Gly Lys Thr Gly Arg Gly Ala Val Ser Glu Lys 

63 35 40 45 

64 Asp Ala Pro Lys Glu Leu Leu Gin Met Leu Glu Lys Gin Lys Lys 

65 50 55 60 

68 <210> SEQ ID NO: 3 

69 <211> LENGTH: 1899 

70 <212> TYPE: DNA 

71 <213> ORGANISM: Artificial Sequence 

73 <220> FEATURE: 

74 <223> OTHER INFORMATION: Description of Artificial Sequence : fusion protein 
75 



Ssod7ci-deltaTaq 



77 <220> FEATURE: 

78 <221> NAME/KEY: CDS 

79 <222> LOCATION: {1)..(1899) 

80 <223> OTHER INFORMATION: Ssod7d-deltaTaq 

82 <400> SEQUENCE: 3 

83 atgattacga attcgagcgc aaccgtaaag ttcaagtaca aaggcgaaga 

84 gacatctcca agatcaagaa agtatggcgt gtgggcaaga tgatctcctt 

85 gagggcggtg gcaagaccgg ccgtggtgcg gtaagcgaaa aggacgcgcc 
8 6 ctgcagatgc tggagaagca gaaaaagggc ggcggtgtca ctagtcccaa 

87 gaggccccct ggcccccgcc ggaaggggcc ttcgtgggct ttgtgctttc 

88 cccatgtggg ccgatcttct ggccctggcc gccgccaggg ggggccgggt 
8 9 cccgagcctt ataaagccct cagggacctg aaggaggcgc gggggcttct 

90 ctgagcgttc tggccctgag ggaaggcctt ggcctcccgc ccggcgacga 

91 ctcgcctacc tcctggaccc ttccaacacc acccccgagg gggtggcccg 
^2_ggggagtgga cggaggaggc gggggagcgg gccgcccttt ccgagaggct 

ct gtggggga ~ggctf gaggg" ggaggagagg ctcctt-t-ggG -ttt accgg.ga. 
cccctttccg ctgtcctggc ccacatggag gccacggggg tgcgcctgga 

95 ctcagggcct tgtccctgga ggtggccgag gagatcgccc gcctcgaggc 

96 cgcctggccg gccacccctt caacctcaac tcccgggacc agctggaaag 

97 gacgagctag ggcttcccgc catcggcaag acggagaaga ccggcaagcg 

98 gccgccgtcc tggaggccct ccgcgaggcc caccccatcg tggagaagat 

99 cgggagctca ccaagctgaa gagcacctac attgacccct tgccggacct 

100 aggacgggcc gcctccacac ccgcttcaac cagacggcca cggccacggg 

101 agctccgatc ccaacctcca gaacatcccc gtccgcaccc cgcttgggca 

102 cgggccttca tcgccgagga ggggtggcta ttggtggccc tggactatag 

103 ctcagggtgc tggcccacct ctccggcgac gagaacctga tccgggtctt 

104 cgggacatcc acacggagac cgccagctgg atgttcggcg tcccccggga 

105 cccctgatgc gccgggcggc caagaccatc aacttcgggg tcctctacgg 

106 caccgcctct cccaggagct agccatccct tacgaggagg cccaggcctt 

107 tactttcaga gcttccccaa ggtgcgggcc tggattgaga agaccctgga 

108 aggcgggggt acgtggagac cctcttcggc cgccgccgct acgtgccaga 

109 cgggtgaaga gcgtgcggga ggcggccgag cgcatggcct tcaacatgcc 

110 accgccgccg acctcatgaa gctggctatg gtgaagctct tccccaggct 

111 ggggccagga tgctccttca ggtccacgac gagctggtcc tcgaggcccc 

112 gcggaggccg tggcccggct ggccaaggag gtcatggagg gggtgtatcc 

113 cccctggagg tggaggtggg gataggggag gactggctct ccgccaagga 

114 ggccgcggcg gaggcgggca tcatcatcat catcattaa 



aaaagaggta 60 
cacctacgac 120 
gaaggagctg 180 
ggccctggag 240 
ccgcaaggag 300 
ccaccgggcc 360 
cgccaaagac 420 
ccccatgctc 480 
gcgctacggc 540 
cttcgccaac 600 

g.gt.gga.ga^gg _6_6 0 

cgtggcctat 720 
cgaggtcttc 780 
ggtcctcttt 840 
ctccaccagc 900 
cctgcagtac 960 
catccacccc 1020 
caggctaagt 1080 
gaggatccgc 1140 
ccagatagag 1200 
ccaggagggg 1260 
ggccgtggac 1320 
catgtcggcc 1380 
cattgagcgc 1440 
ggagggcagg 1500 
ectagaggcc 1560 
cgtccagggc 1620 
ggaggaaatg 1680 
aaaagagagg 1740 
cctggccgtg 1800 
gggcattgat 1860 
1899 
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117 


<210> SEQ ID NO: 


: 4 
























118 


<211> LENGTH: 632 
























119 


<212> TYPE: 


PRT 


























120 


<213> ORGANISM: 


Artificial Sequence 
















122 


<220> FEATURE: 


























123 


<223> OTHER 


INFORMATION 


: Description of 


Artificial Sequence 


: fus. 


124 




Ssod7d-deltaTaq 






















126 


<400> SEQUENCE: 


4 
























127 


Met 


He 


Thr 


Asn 


Ser 


Ser 


Ala 


Thr 


Val 


Lys 


Phe 


Lys 


Tyr 


Lys 


Gly Glu 


128 


1 








5 










10 










15 




129 


Glu 


Lys 


Glu 


Val 


Asp 


He 


Ser 


Lys 


He 


Lys 


Lys 


Val 


Trp 


Arg 


Val 


Gly 


130 








20 










25 










30 






131 


Lys 


Met 


He 


Ser 


Phe 


Thr 


Tyr 


Asp 


Glu 


Gly 


Gly 


Gly 


Lys 


Thr 


Gly Arg 


132 






35 










40 










45 








133 


Gly 


Ala 


Val 


Ser 


Glu 


Lys 


Asp 


Ala 


Pro 


Lys 


Glu 


Leu 


Leu 


Gin 


Met 


Leu 


134 




50 










55 










60 










135 


Glu 


Lys 


Gin 


Lys 


Lys 


Gly 


Gly 


Gly 


Val 


Thr 


Ser 


Pro 


Lys 


Ala 


Leu 


Glu 


136 


65 










70 










75 










80 


137 


Glu 


Ala 


Pro 


Trp 


Pro 


Pro 


Pro 


Glu 


Gly 


Ala 


Phe 


Val 


Gly 


Phe 


Val 


Leu 


138 










85 










90 










95 




139 


Ser 


Arg 


Lys 


Glu 


Pro 


Met 


Trp 


Ala 


Asp 


Leu 


Leu 


Ala 


Leu 


Ala 


Ala 


Ala 


140 








100 










105 










110 






141 


Arg 


Gly 


Gly 


Arg 


Val 


His 


Arg 


Ala 


Pro 


Glu 


Pro 


Tyr 


Lys 


Ala 


Leu Arg 


142 






115 










120 










125 








143 


Asp 


Leu 


Lys 


Glu 


Ala 


Arg 


Gly 


Leu 


Leu 


Ala 


Lys 


Asp 


Leu 


Ser 


Val 


Leu 


144 




130 










135 










140 










145 


Ala 


Leu 


Arg 


Glu 


Gly 


Leu 


Gly 


Leu 


Pro 


Pro 


Gly 


Asp 


Asp 


Pro 


Met 


Leu 


146 


145^ 














— - 








-1-55- 













160 


147 


Leu 


Ala 


Tyr 


Leu 


Leu 


Asp 


Pro 


Ser 


Asn 


Thr 


Thr 


Pro 


Glu 


Gly 


Val 


Ala 


148 










165 










170 










175 




149 


Arg 


Arg 


Tyr 


Gly 


Gly 


Glu 


Trp 


Thr 


Glu 


Glu 


Ala 


Gly 


Glu 


Arg 


Ala 


Ala 


150 








180 










185 










190 






151 


Leu 


Ser 


Glu 


Arg 


Leu 


Phe 


Ala 


Asn 


Leu 


Trp 


Gly 


Arg 


Leu 


Glu 


Gly 


Glu 


152 






195 










200 










205 








153 


Glu 


Arg 


Leu 


Leu 


Trp 


Leu 


Tyr 


Arg 


Glu 


Val 


Glu 


Arg 


Pro 


Leu 


Ser 


Ala 


154 




210 










215 










220 










155 


Val 


Leu 


Ala 


His 


Met 


Glu 


Ala 


Thr 


Gly 


Val 


Arg 


Leu 


Asp 


Val 


Ala 


Tyr 


156 


225 










230 










235 










240 


157 


Leu 


Arg 


Ala 


Leu 


Ser 


Leu 


Glu 


Val 


Ala 


Glu 


Glu 


He 


Ala 


Arg 


Leu 


Glu 


158 










245 










250 










255 




159 


Ala 


Glu 


Val 


Phe 


Arg 


Leu 


Ala 


Gly 


His 


Pro 


Phe 


Asn 


Leu 


Asn 


Ser 


Arg 


160 








260 










265 










270 






161 


Asp 


Gin 


Leu 


Glu 


Arg 


Val 


Leu 


Phe 


Asp 


Glu 


Leu 


Gly 


Leu 


Pro 


Ala 


He 


162 






275 










280 










285 








163 


Gly 


Lys 


Thr 


Glu 


Lys 


Thr 


Gly 


Lys 


Arg 


Ser 


Thr 


Ser 


Ala 


Ala 


Val 


Leu 


164 




290 










295 










300 










165 


Glu 


Ala 


Leu 


Arg 


Glu 


Ala 


His 


Pro 


He 


Val 


Glu 


Lys 


He 


Leu 


Gin 


Tyr 


166 


305 










310 










315 










320 


167 


Arg 


Glu 


Leu 


Thr 


Lys 


Leu 


Lys 


Ser 


Thr 


Tyr 


He 


Asp 


Pro 


Leu 


Pro 


Asp 
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168 










325 










330 










335 




169 


Leu 


He 


His 


Pro 


Arg 


Thr 


Gly 


Arg 


Leu 


His 


Thr 


Arg 


Phe 


Asn 


Gin 


Thr 


170 








340 










345 










350 






171 


Ala 


Thr 


Ala 


Thr 


Gly 


Arg 


Leu 


Ser 


Ser 


Ser 


Asp 


Pro 


Asn 


Leu 


Gin 


Asn 


172 






355 










360 










365 








173 


He 


Pro 


Val 


Arg 


Thr 


Pro 


Leu 


Gly 


Gin 


Arg 


He 


Arg 


Arg 


Ala 


Phe 


He 


174 




370 










375 










380 










175 


Ala 


Glu 


Glu 


Gly 


Trp 


Leu 


Leu 


Val 


Ala 


Leu 


Asp 


Tyr 


Ser 


Gin 


He 


Glu 


176 


385 










390 










395 










400 


177 


Leu 


Arg 


Val 


Leu 


Ala 


His 


Leu 


Ser 


Gly 


Asp 


Glu 


Asn 


Leu 


He 


Arg 


Val 


178 










405 










410 










415 




179 


Phe 


Gin 


Glu 


Gly 


Arg 


Asp 


He 


His 


Thr 


Glu 


Thr 


Ala 


Ser 


Trp 


Met 


Phe 


180 








420 










425 










430 






181 


Gly Val 


Pro 


Arg 


Glu 


Ala 


Val 


Asp 


Pro 


Leu 


Met 


Arg 


Arg 


Ala 


Ala 


Lys 


182 






435 










440 










445 








183 


Thr 


He 


Asn 


Phe 


Gly 


Val 


Leu 


Tyr 


Gly 


Met 


Ser 


Ala 


His 


Arg 


Leu 


Ser 


184 




450 










455 










460 










185 


Gin 


Glu 


Leu 


Ala 


He 


Pro 


Tyr 


Glu 


Glu 


Ala 


Gin 


Ala 


Phe 


He 


Glu 


Arg 


186 


465 










470 










475 










480 


187 


Tyr 


Phe 


Gin 


Ser 


Phe 


Pro 


Lys 


Val 


Arg 


Ala 


Trp 


He 


Glu 


Lys 


Thr 


Leu 


188 










485 










490 










495 




189 


Glu 


Glu 


Gly 


Arg 


Arg 


Arg 


Gly 


Tyr 


Val 


Glu 


Thr 


Leu 


Phe 


Gly 


Arg 


Arg 


190 








500 










505 










510 






191 


Arg 


Tyr 


Val 


Pro 


Asp 


Leu 


Glu 


Ala 


Arg 


Val 


Lys 


Ser 


Val 


Arg 


Glu 


Ala 


192 






515 










520 










525 








193 


Ala 


Glu 


Arg 


Met 


Ala 


Phe 


Asn 


Met 


Pro 


Val 


Gin 


Gly 


Thr 


Ala 


Ala 


Asp 


194 




530 










535 










540 










"195" 


"L'eu" 


^Met" 


^Lys~ 


"Leu 


-Al-a- 


Met 


-Val- 


-Lys 


-Leu 


-Phe- 


-Pro 


-Arg,. 


-Leu_ 


-Glu 


-G_lu_ 


Met 


196 


545 










550 










555 










560 


197 


Gly Ala 


Arg 


Met 


Leu 


Leu 


Gin 


Val 


His 


Asp 


Glu 


Leu 


Val 


Leu 


Glu 


Ala 


198 










565 










570 










575 




199 


Pro 


Lys 


Glu 


Arg 


Ala 


Glu 


Ala 


Val 


Ala 


Arg 


Leu 


Ala 


Lys 


Glu 


Val 


Met 


200 








580 










585 










590 






201 


Glu 


Gly 


Val 


Tyr 


Pro 


Leu 


Ala 


Val 


Pro 


Leu 


Glu 


Val 


Glu 


Val 


Gly 


He 


202 






595 










600 










605 








203 


Gly Glu Asp 


Trp 


Leu 


Ser 


Ala 


Lys 


Glu 


Gly 


He 


Asp 


Gly 


Arg 


Gly 


Gly 


204 




610 










615 










620 










205 


Gly 


Gly 


His 


His 


His 


His 


His 


His 



















206 625 630 

209 <210> SEQ ID NO: 5 

210 <211> LENGTH: 2763 

211 <212> TYPE: DNA 

212 <213> ORGANISM: Artificial Sequence 

214 <220> FEATURE: 

215 <223> OTHER INFORMATION: Description of Artificial Sequence : fusion protein 

216 Ssod7d/full-length Taq 

218 <220> FEATURE: 

219 <221> NAME/KEY: CDS 

220 <222> LOCATION: (1)..(2763) 
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221 <223> OTHER INFORMATION: Ssod7d/f ull-length Taq 

223 <400> SEQUENCE: 5 

224 atgattacga attcgagcgc aaccgtaaag ttcaagtaca aaggcgaaga 

225 gacatctcca agatcaagaa agtatggcgt gtgggcaaga tgatctcctt 

226 gagggcggtg gcaagaccgg ccgtggtgcg gtaagcgaaa aggacgcgcc 

227 ctgcagatgc tggagaagca gaaaaagggc ggcggtgtca ctagtgggat 

228 tttgagccca agggccgggt cctcctggtg gacggccacc acctggccta 

229 cacgccctga agggcctcac caccagccgg ggggagccgg tgcaggcggt 

230 gccaagagcc tcctcaaggc cctcaaggag gacggggacg cggtgatcgt 

231 gccaaggccc cctccttccg ccacgaggcc tacggggggt acaaggcggg 

232 acgccagagg actttccccg gcaactcgcc ctcatcaagg agctggtgga 

233 ctggcgcgcc tcgaggtccc gggctacgag gcggacgacg tcctggccag 

234 aaggcggaaa aggagggcta cgaggtccgc atcctcaccg ccgacaaaga 

235 ctcctttccg accgcatcca cgtcctccac cccgaggggt acctcatcac 

236 ctttgggaaa agtacggcct gaggcccgac cagtgggccg actaccgggc 

237 gacgagtccg acaaccttcc cggggtcaag ggcatcgggg agaagacggc 

238 ctggaggagt gggggagcct ggaagccctc ctcaagaacc tggaccggct 

239 atccgggaga agatcctggc ccacatggac gatctgaagc tctcctggga 

240 gtgcgcaccg acctgcccct ggaggtggac ttcgccaaaa ggcgggagcc 

241 aggcttaggg cctttctgga gaggcttgag tttggcagcc tcctccacga 

242 ctggaaagcc ccaaggccct ggaggaggcc ccctggcccc cgccggaagg 
24 3 ggctttgtgc tttcccgcaa ggagcccatg tgggccgatc ttctggccct 
24 4 agggggggcc gggtccaccg ggcccccgag ccttataaag ccctcaggga 
245 gcgcgggggc ttctcgccaa agacctgagc gttctggccc tgagggaagg 
24 6 ccgcccggcg acgaccccat gctcctcgcc tacctcctgg acccttccaa 
24 7 gagggggtgg cccggcgcta cggcggggag tggacggagg aggcggggga 
248 ctttccgaga ggctcttcgc caacctgtgg gggaggcttg agggggagga 

" 2'4 9~tggctttacc gggagg-tgga— gaggcGGctt- tccgctgtcc -tggcaca.aat 

250 ggggtgcgcc tggacgtggc ctatctcagg gccttgtccc tggaggtggc 

251 gcccgcctcg aggccgaggt cttccgcctg gccggccacc ccttcaacct 

252 gaccagctgg aaagggtcct ctttgacgag ctagggcttc ccgccatcgg 

253 aagaccggca agcgctccac cagcgccgcc gtcctggagg ccctccgcga 

254 atcgtggaga agatcctgca gtaccgggag ctcaccaagc tgaagagcac 

255 cccttgccgg acctcatcca ccccaggacg ggccgcctcc acacccgctt 

256 gccacggcca cgggcaggct aagtagctcc gatcccaacc tccagaacat 

257 accccgcttg ggcagaggat ccgccgggcc ttcatcgccg aggaggggtg 

258 gccctggact atagccagat agagctcagg gtgctggccc acctctccgg 

259 ctgatccggg tcttccagga ggggcgggac atccacacgg agaccgccag 

260 ggcgtccccc gggaggccgt ggaccccctg atgcgccggg cggccaagac 

261 ggggtcctct acggcatgtc ggcccaccgc ctctcccagg agctagccat 

262 gaggcccagg ccttcattga gcgctacttt cagagcttcc ccaaggtgcg 

263 gagaagaccc tggaggaggg caggaggcgg gggtacgtgg agaccctctt 

264 cgctacgtgc cagacctaga ggcccgggtg aagagcgtgc gggaggcggc 

265 gccttcaaca tgcccgtcca gggcaccgcc gccgacctca tgaagctggc 

266 ctcttcccca ggctggagga aatgggggcc aggatgctcc ttcaggtcca 

267 gtcctcgagg ccccaaaaga gagggcggag gccgtggccc ggctggccaa 

268 gagggggtgt atcccctggc cgtgcccctg gaggtggagg tggggatagg 

269 ctctccgcca aggagggcat tgatggccgc ggcggaggcg ggcatcatca 

270 taa 



aaaagaggta 
cacctacgac 
gaaggagctg 
gctgcccctc 
ccgcaccttc 
ctacggcttc 
ggtctttgac 
ccgggccccc 
cctcctgggg 
cctggccaag 
cctttaccag 
cccggcctgg 
cctgaccggg 
gaggaagctt 
gaagcccgcc 
cctggccaag 
cgaccgggag 
gttcggcctt 
ggccttcgtg 
ggccgccgcc 
cctgaaggag 
ccttggcctc 
caccaccccc 
gcgggccgcc 
gaggctcctt 
.g.gag.gccacg 
cgaggagatc 
caactcccgg 
caagacggag 
ggcccacccc 
ctacattgac 
caaccagacg 
ccccgtccgc 
gctattggtg 
cgacgagaac 
ctggatgttc 
catcaacttc 
cccttacgag 
ggcctggatt 
cggccgccgc 
cgagcgcatg 
tatggtgaag 
cgacgagctg 
ggaggtcatg 
ggaggactgg 
tcatcatcat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620^ 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2763 
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